Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paul Feng

AXA REV Lausanne

Field typing for improved recognition on heterogeneous handwritten forms

Sep 23, 2019

Ciprian Tomoiaga, Paul Feng, Mathieu Salzmann, Patrick Jayet

Figure 1 for Field typing for improved recognition on heterogeneous handwritten forms

Figure 2 for Field typing for improved recognition on heterogeneous handwritten forms

Figure 3 for Field typing for improved recognition on heterogeneous handwritten forms

Figure 4 for Field typing for improved recognition on heterogeneous handwritten forms

Abstract:Offline handwriting recognition has undergone continuous progress over the past decades. However, existing methods are typically benchmarked on free-form text datasets that are biased towards good-quality images and handwriting styles, and homogeneous content. In this paper, we show that state-of-the-art algorithms, employing long short-term memory (LSTM) layers, do not readily generalize to real-world structured documents, such as forms, due to their highly heterogeneous and out-of-vocabulary content, and to the inherent ambiguities of this content. To address this, we propose to leverage the content type within an LSTM-based architecture. Furthermore, we introduce a procedure to generate synthetic data to train this architecture without requiring expensive manual annotations. We demonstrate the effectiveness of our approach at transcribing text on a challenging, real-world dataset of European Accident Statements.

Via

Access Paper or Ask Questions